Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146285
posts in
46.8
ms
Learning to
maintain
safety through expert
demonstrations
in settings with unknown constraints: A Q-learning perspective
arxiv.org
·
20h
📊
Dynamic Programming
Recurrent
Neural Networks (
RNN
) From Scratch: The First Step Toward Modern LLMs
pub.towardsai.net
·
4h
🤖
TVM
Stochastic
Gradient Methods: Bias, Stability and
Generalization
jmlr.org
·
13h
📊
Optimization
Less is more -- the
Dispatcher
/
Executor
principle for multi-task Reinforcement Learning
arxiv.org
·
20h
💬
Prompt Engineering
Show HN:
StrategicConsult
– Game theory
augmented
AI for decision making
negotiatecash.com
·
1h
·
Discuss:
Hacker News
🎮
Game Theory
Tabular
representation
learning
breno.bearblog.dev
·
12h
🎨
Chroma
Apollo
DQN
: Building an RL Agent for
LunarLander-v3
pub.towardsai.net
·
4h
🎲
Deterministic Simulation
A
Reinforcement
Learning Approach in Multi-Phase Second-Price
Auction
Design
jmlr.org
·
13h
📊
Dynamic Programming
Start Big Learn
Along
The Way
dev.to
·
9h
·
Discuss:
DEV
💬
Prompt Engineering
tfatykhov/nous
: Agent with decisions memory at its core
github.com
·
19h
·
Discuss:
Hacker News
⚓
Anchors
Your Brain
Tracks
Rewards
In Real Time, And It Can Make You Move Faster
studyfinds.com
·
11h
🧠
Cognitive Science
Unifying
non-Markovian
dynamics and agent
heterogeneity
in scalable stochastic networks
nature.com
·
9h
🔲
Cellular Automata
Building
specialized
AI without
sacrificing
intelligence: Nova Forge data mixing in action
aws.amazon.com
·
5h
🌀
Naiad
LEARN
uphillathlete.com
·
9h
🎴
Anki
Ask HN:
Statistical
learning and
non-Statistical
learning for
humans
news.ycombinator.com
·
11h
·
Discuss:
Hacker News
🧠
Machine Learning
Forms
of
Forgetting
psychologytoday.com
·
5h
🎴
Anki
Stochastic
Kernel-Switching
Error Diffusion
blog.kaetemi.be
·
11h
📈
Delta Encoding
Industrial-Grade
Physical
AI Systems
trendhunter.com
·
10h
🛡️
AI Security
How to Build
Autonomous
Data Systems for Real-Time
Decisioning
confluent.io
·
3h
⏰
Timely Dataflow
The
Architecture
Behind Open-Source LLMs
blog.bytebytego.com
·
8h
🦙
Ollama
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help